Skip to content

Build CUDA + OSU-Micro-Benchmarks GPU software for supported combinations of CPU and CUDA compute capability 90 #1077

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

TopRichard
Copy link
Collaborator

@TopRichard TopRichard commented May 6, 2025

  •  x86_64_generic
  •  cascadelake
  •  haswell
  •  icelake
  •  sapphirerapids
  •  skylake
  •  zen2
  •  zen3
  •  zen4 exists/built on gpu-node but missing CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1 and other software
  •  aarch64_generic
  •  neoverse_n1
  •  neoverse_v1
  •  nvidia/grace exist/built on gpu-node

…ions of CPU and CUDA compute capability 90
@TopRichard TopRichard added 2023.06-software.eessi.io 2023.06 version of software.eessi.io accel:nvidia labels May 6, 2025
Copy link

eessi-bot bot commented May 6, 2025

Instance eessi-bot-mc-aws is configured to build for:

  • architectures: x86_64/generic, x86_64/intel/haswell, x86_64/intel/sapphirerapids, x86_64/intel/skylake_avx512, x86_64/intel/cascadelake, x86_64/intel/icelake, x86_64/amd/zen2, x86_64/amd/zen3, aarch64/generic, aarch64/neoverse_n1, aarch64/neoverse_v1
  • repositories: eessi.io-2023.06-compat, eessi.io-2023.06-software

Copy link

eessi-bot bot commented May 6, 2025

Instance eessi-bot-mc-azure is configured to build for:

  • architectures: x86_64/amd/zen4
  • repositories: eessi.io-2023.06-compat, eessi.io-2023.06-software

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Instance eessi-bot-vsc-ugent is configured to build for:

  • architectures: x86_64/amd/zen3
  • repositories: eessi-hpc.org-2023.06-software, eessi.io-2023.06-compat, eessi-hpc.org-2023.06-compat, eessi.io-2023.06-software

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented May 6, 2025

Instance eessi-bot-surf is configured to build for:

  • architectures: x86_64/amd/zen4, x86_64/amd/zen2
  • repositories: eessi-hpc.org-2023.06-software, eessi.io-2023.06-software, eessi.io-2023.06-compat, eessi-hpc.org-2023.06-compat

@eessi-bot-toprichard
Copy link

Instance rt-Grace-jr is configured to build for:

  • architectures: aarch64/nvidia/grace
  • repositories: eessi.io-2023.06-software

@TopRichard TopRichard marked this pull request as draft May 6, 2025 12:07
@TopRichard
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/generic accel:nvidia/cc90

Copy link

eessi-bot bot commented May 6, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Updates by the bot instance eessi-bot-vsc-ugent (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented May 6, 2025

Updates by the bot instance eessi-bot-surf (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

@eessi-bot-toprichard
Copy link

eessi-bot-toprichard bot commented May 6, 2025

Updates by the bot instance rt-Grace-jr (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented May 6, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/generic accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Label bot:build has been set by user TopRichard, but this person does not have permission to trigger builds

Copy link

eessi-bot bot commented May 6, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-generic and accelerator nvidia/cc90 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/61254

date job status comment
May 06 12:13:19 UTC 2025 submitted job id 61254 awaits release by job manager
May 06 12:14:26 UTC 2025 released job awaits launch by Slurm scheduler
May 06 12:20:29 UTC 2025 running job 61254 is running
May 06 13:20:26 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-61254.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-generic-17465358210.tar.gzsize: 4944 MiB (5184355656 bytes)
entries: 12575
modules under 2023.06/software/linux/x86_64/generic/accel/nvidia/cc90/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/generic/accel/nvidia/cc90/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/generic/accel/nvidia/cc90
no other files in tarball
May 06 13:20:26 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-61254.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
May 12 10:18:34 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-generic-17465358210.tar.gz to S3 bucket succeeded
May 12 13:55:45 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-generic-17465358210.tar.gz to S3 bucket succeeded

@TopRichard
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen2 accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen3 accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/haswell accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/skylake_avx512 accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/sapphirerapids accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/generic accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_n1 accel:nvidia/cc90
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_v accel:nvidia/cc90

Copy link

eessi-bot bot commented May 6, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented May 6, 2025

Updates by the bot instance eessi-bot-surf (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen2 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen3 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/haswell accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/skylake_avx512 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/sapphirerapids accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_n1 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_v accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Updates by the bot instance eessi-bot-vsc-ugent (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen2 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen3 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/haswell accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/skylake_avx512 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/sapphirerapids accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_n1 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_v accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90 resulted in:

    • account TopRichard has NO permission to submit build jobs

@eessi-bot-toprichard
Copy link

eessi-bot-toprichard bot commented May 6, 2025

Updates by the bot instance rt-Grace-jr (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen2 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen3 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/haswell accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/skylake_avx512 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/sapphirerapids accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_n1 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_v accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented May 6, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen2 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen3 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/haswell accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/skylake_avx512 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/sapphirerapids accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/generic accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_n1 accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:aarch64/neoverse_v accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen2 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/amd/zen3 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/haswell accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/skylake_avx512 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/sapphirerapids accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/generic accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_n1 accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:aarch64/neoverse_v accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Label bot:build has been set by user TopRichard, but this person does not have permission to trigger builds

Copy link

eessi-bot bot commented May 6, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 and accelerator nvidia/cc90 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/61262

date job status comment
May 06 14:49:02 UTC 2025 submitted job id 61262 awaits release by job manager
May 06 14:49:25 UTC 2025 released job awaits launch by Slurm scheduler
May 06 14:54:47 UTC 2025 running job 61262 is running
May 06 15:58:03 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-61262.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-17465452850.tar.gzsize: 4944 MiB (5184473326 bytes)
entries: 12575
modules under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc90/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc90/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc90
no other files in tarball
May 06 15:58:03 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-61262.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
May 12 10:20:28 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-17465452850.tar.gz to S3 bucket succeeded
May 12 13:57:53 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-17465452850.tar.gz to S3 bucket succeeded

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Label bot:build has been set by user TopRichard, but this person does not have permission to trigger builds

Copy link

eessi-bot bot commented May 6, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen3 and accelerator nvidia/cc90 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/61263

date job status comment
May 06 14:49:07 UTC 2025 submitted job id 61263 awaits release by job manager
May 06 14:49:28 UTC 2025 released job awaits launch by Slurm scheduler
May 06 14:54:50 UTC 2025 running job 61263 is running
May 06 15:46:50 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-61263.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-17465449140.tar.gzsize: 4944 MiB (5184437359 bytes)
entries: 12575
modules under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc90/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc90/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc90
no other files in tarball
May 06 15:46:50 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-61263.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
May 12 10:22:21 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen3-17465449140.tar.gz to S3 bucket succeeded
May 12 14:00:11 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen3-17465449140.tar.gz to S3 bucket succeeded

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Label bot:build has been set by user TopRichard, but this person does not have permission to trigger builds

Copy link

eessi-bot bot commented May 6, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-intel-cascadelake and accelerator nvidia/cc90 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/61264

date job status comment
May 06 14:49:17 UTC 2025 submitted job id 61264 awaits release by job manager
May 06 14:49:30 UTC 2025 released job awaits launch by Slurm scheduler
May 06 14:56:12 UTC 2025 running job 61264 is running
May 06 16:10:08 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-61264.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-cascadelake-17465454580.tar.gzsize: 4944 MiB (5184583567 bytes)
entries: 12598
modules under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc90/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
GDRCopy/2.4-GCCcore-13.2.0.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc90/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
GDRCopy/2.4-GCCcore-13.2.0
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc90
no other files in tarball
May 06 16:10:08 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-61264.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented May 6, 2025

Label bot:build has been set by user TopRichard, but this person does not have permission to trigger builds

Copy link

eessi-bot bot commented May 8, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented May 8, 2025

Updates by the bot instance eessi-bot-surf (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

@eessi-bot-toprichard
Copy link

eessi-bot-toprichard bot commented May 8, 2025

Updates by the bot instance rt-Grace-jr (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc90 from TopRichard

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc90 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented May 8, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-intel-cascadelake and accelerator nvidia/cc90 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/61713

date job status comment
May 08 18:10:49 UTC 2025 submitted job id 61713 awaits release by job manager
May 08 18:11:51 UTC 2025 released job awaits launch by Slurm scheduler
May 08 18:16:54 UTC 2025 running job 61713 is running
May 08 19:32:26 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-61713.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-cascadelake-17467303640.tar.gzsize: 4944 MiB (5184602169 bytes)
entries: 12598
modules under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc90/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
GDRCopy/2.4-GCCcore-13.2.0.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc90/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
GDRCopy/2.4-GCCcore-13.2.0
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc90
no other files in tarball
May 08 19:32:26 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-61713.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

Copy link

eessi-bot bot commented May 8, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-intel-icelake and accelerator nvidia/cc90 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/61714

date job status comment
May 08 18:10:54 UTC 2025 submitted job id 61714 awaits release by job manager
May 08 18:11:54 UTC 2025 released job awaits launch by Slurm scheduler
May 08 18:17:05 UTC 2025 running job 61714 is running
May 08 19:17:44 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-61714.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-icelake-17467299640.tar.gzsize: 4944 MiB (5184585543 bytes)
entries: 12598
modules under 2023.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc90/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
GDRCopy/2.4-GCCcore-13.2.0.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc90/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
GDRCopy/2.4-GCCcore-13.2.0
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc90
no other files in tarball
May 08 19:17:44 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-61714.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator

Hm, this version of GDRcopy wasn't in there yet, it is now, so rebuilding again

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc80
bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc80

Copy link

eessi-bot bot commented May 12, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)

@eessi-bot-toprichard
Copy link

Updates by the bot instance rt-Grace-jr (click for details)
  • account casparvl has NO permission to send commands to the bot

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented May 12, 2025

Updates by the bot instance eessi-bot-surf (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc80
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented May 12, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-intel-cascadelake and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/62577

date job status comment
May 12 08:01:43 UTC 2025 submitted job id 62577 awaits release by job manager
May 12 08:01:51 UTC 2025 released job awaits launch by Slurm scheduler
May 12 08:08:03 UTC 2025 running job 62577 is running
May 12 09:23:08 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-62577.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-cascadelake-17470394110.tar.gzsize: 4917 MiB (5156016798 bytes)
entries: 12575
modules under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc80/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc80/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc80
no other files in tarball
May 12 09:23:08 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-62577.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
May 12 10:24:15 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-cascadelake-17470394110.tar.gz to S3 bucket succeeded
May 12 14:02:25 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-cascadelake-17470394110.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented May 12, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-intel-icelake and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.05/pr_1077/62579

date job status comment
May 12 08:01:48 UTC 2025 submitted job id 62579 awaits release by job manager
May 12 08:01:57 UTC 2025 released job awaits launch by Slurm scheduler
May 12 08:09:13 UTC 2025 running job 62579 is running
May 12 09:10:50 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-62579.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-icelake-17470390950.tar.gzsize: 4917 MiB (5155987214 bytes)
entries: 12575
modules under 2023.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
CUDA/12.1.1.lua
CUDA/12.4.0.lua
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1.lua
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1.lua
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0.lua
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0.lua
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0.lua
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1.lua
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0.lua
software under 2023.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
CUDA/12.1.1
CUDA/12.4.0
CUDA-Samples/12.1-GCC-12.3.0-CUDA-12.1.1
NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1
NCCL/2.20.5-GCCcore-13.2.0-CUDA-12.4.0
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
OSU-Micro-Benchmarks/7.5-gompi-2023b-CUDA-12.4.0
UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
UCC-CUDA/1.2.0-GCCcore-13.2.0-CUDA-12.4.0
UCX-CUDA/1.14.1-GCCcore-12.3.0-CUDA-12.1.1
UCX-CUDA/1.15.0-GCCcore-13.2.0-CUDA-12.4.0
other under 2023.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
no other files in tarball
May 12 09:10:50 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-62579.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
May 12 10:28:15 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-icelake-17470390950.tar.gz to S3 bucket succeeded
May 12 14:06:58 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-icelake-17470390950.tar.gz to S3 bucket succeeded

@casparvl casparvl added bot:deploy Ask bot to deploy missing software installations to EESSI and removed ready-to-review labels May 12, 2025
@eessi-bot-toprichard
Copy link

Label bot:deploy has been set by user casparvl, but this person does not have permission to trigger deployments

@TopRichard TopRichard added bot:deploy Ask bot to deploy missing software installations to EESSI and removed bot:deploy Ask bot to deploy missing software installations to EESSI labels May 12, 2025
Copy link

eessi-bot bot commented May 12, 2025

Label bot:deploy has been set by user TopRichard, which has no permission to trigger the action

@eessi-bot-deucalion
Copy link

Label bot:deploy has been set by user TopRichard, but this person does not have permission to trigger deployments

Copy link

eessi-bot bot commented May 12, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/cascadelake accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc80
  • received bot command build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/intel/icelake accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/cascadelake accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software instance:eessi-bot-mc-aws architecture:x86_64/intel/icelake accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented May 12, 2025

Label bot:deploy has been set by user TopRichard, which has no permission to trigger the action

@casparvl casparvl added bot:deploy Ask bot to deploy missing software installations to EESSI and removed bot:deploy Ask bot to deploy missing software installations to EESSI labels May 12, 2025
@eessi-bot-toprichard
Copy link

Label bot:deploy has been set by user casparvl, but this person does not have permission to trigger deployments

@casparvl
Copy link
Collaborator

Still seems to be missing Grace, @TopRichard will have a look at it once he has time. Probably just try to reset the label first, maybe the tarball upload failed?

@casparvl
Copy link
Collaborator

As @TopRichard correctly pointed out, this was deployed for Grace already earlier. So no need to do so here. Merging!

@casparvl casparvl merged commit 0e5993a into EESSI:2023.06-software.eessi.io May 12, 2025
63 checks passed
Copy link

eessi-bot bot commented May 12, 2025

PR merged! Moved ['/project/def-users/SHARED/jobs/2025.05/pr_1077/61254', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61262', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61263', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61264', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61265', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61266', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61267', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61268', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61269', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61270', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61271', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61705', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61706', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61713', '/project/def-users/SHARED/jobs/2025.05/pr_1077/61714', '/project/def-users/SHARED/jobs/2025.05/pr_1077/62577', '/project/def-users/SHARED/jobs/2025.05/pr_1077/62579'] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.05.12

Copy link

eessi-bot bot commented May 12, 2025

PR merged! Moved ['/project/def-users/SHARED/jobs/2025.05/pr_1077/2517'] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.05.12

@casparvl
Copy link
Collaborator

casparvl commented May 14, 2025

Darn, I discovered that for icelake and cascadelake, I accidentally built for cc80 in this PR, instead of cc90. This can be seen by checking the prefix for the files in the staging PR https://github.com/EESSI/staging/pull/2818 , which include cc80 instead of cc90.

It's not a big deal, but it means essentially that this PR hasn't been deployed (yet) for icelake + cc90 and cascadelake + cc90.

This will fix itself when we deploy the next PR for cc90, but may make that look a bit unexpected, as it will also install all the basic CUDA stuff from this PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2023.06-software.eessi.io 2023.06 version of software.eessi.io accel:nvidia bot:deploy Ask bot to deploy missing software installations to EESSI
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants